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COMPARISON OF CUTOFFS BETWEEN LAZY WALKS AND 
MARKOVIAN SEMIGROUPS 

GUAN-YU CHEN 1 AND LAURENT SALOFF-COSTE 2 



Abstract. We make a connection between the continuous time and lazy dis- 
crete time Markov chains through the comparison of cutoffs and mixing time 
in total variation distance. For illustration, we consider finite birth and death 
chains and provide a criterion on cutoffs using eigenvalues of the transition 
matrix. 



P^ ■ 1. Introduction 

Q, ■ 

Let 17 be a countable set and (17, K, it) be an irreducible Markov chain on 17 with 
i-^ . transition matrix K and stationary distribution it. Let 



^ 



H t = e- t V- K ) = J2 e ~ t t iKi / i] 



i=0 

be the associated semigroup which describes the corresponding natural continuous 
time process on 17. For S e (0, 1), set 

£j ! (1-1) K B = 51 + (I- 5)K, 

I/"") | where / is the identity matrix indexed by 17. Clearly, Kg is similar to K but with an 

additional holding probability depending of <5. We call Kg the 5-lazy walk or 5-lazy 
^+ \ chain of K. It is well-known that if K is irreducible with stationary distribution n, 

O ■ then 

CO; lim K s n (x,y) = lim H t (x,y) = n(y), Vx,y € 17, S € (0, 1). 

m— foo t— ^oo 

In this paper, we consider convergence in total variation. The total variation 
between two probabilities n, v on 17 is defined by ||/x — z/|| T v = sup{/x(A) — i/(A)| A C 
j^ \ 17}. For any irreducible K with stationary distribution 7r, the (maximum) total 

H ■ variation distance is defined by 

(1.2) d TV (m) = sup||ir"(z,.)-7r|| TV , 

and the corresponding mixing time is given by 

(1.3) T TV (e) = inf{m > 0\d TV (m) < e}. 

We write the total variation distance and mixing time as d^v, T^j for the continuous 
semigroup and as d^ v ,T^y for the 5-lazy walk. 

A sharp transition phenomenon, known as cutoff, was introduced by Aldous and 
Diaconis in early 1980s. See e.g. [8j [5] for an introduction and a general review of 
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cutoffs. In total variation, a family of irreducible Markov chains (Q n , K n , 7T n )$£Li is 
said to present a cutoff if 

(1.4) lim ^44 = 1, VO < e < r, < 1. 

The family is said to present a (t n , 6„) cutoff if 6„ = o(i„) and 

|T njTV (e)-t n |=0(6„), V0<e<l. 

The cutoff for the associated continuous semigroups is defined in a similar way. 
This paper contains the following general result. 

Theorem 1.1. Consider a family of irreducible and positive recurrent Markov 
chains J- = {(0„, K n , n n )\n — 1, 2, ...}. For 5 £ (0, 1), let J-$ be the family of as- 
sociated S-lazy walks and let J- c be the family of associated continuous semigroups. 
Suppose T„ tv( £ o) ™^ °° f or some €q £ (0, 1). Then, the following are equivalent. 

(1) !F$ has a cutoff in total variation. 

(2) J- c has a cutoff in total variation. 

Furthermore, if T c has a cutoff, then 

71(c) , s 
lim y! = i-j, Vee(0,l). 

I tl,TVW 

Theorem 1.2. Let J 7 6e t/ie family in Theorem \l.l\ Assume that t n — > 00. Then, 
the following are equivalent. 

(1) J-" c has a (t n ,b n ) cutoff. 

(2) for (5 e (0, 1), Ji /ias a (i„/(l - 5),b„) cutoff. 

We refer the readers to Theorems 13.11 13. 4[ 13.51 and 13.71 for more detailed discus- 
sions. 

For an illustration, we consider finite birth and death chains. For n > 1, let 
^n = {0, 1, ..., n\ and K n be the transition kernel of a birth and death chain on Q n 
with birth rate p n ,i, death rate q n ^ and holding rate r n ^, where p n . n = q n ,o = an d 
Pn,i + Qn,i + r n,i = 1- Suppose that K n is irreducible with stationary distribution 
7r„. For the family {(H, n ,K n ,ir n )\n = 1,2,...}, Ding et al. [10] showed that, in 
the discrete time case, if infi. n r n ^ > 0, then the cutoff in total variation exists if 
and only if the product of the total variation mixing time and the spectral gap, 
which is defined to be the smallest non-zero eigenvalue of / — K, tends to infinity. 
There is also a similar version for the continuous time case. The next theorem 
is an application of the above result and Theorem 11.11 which is summarized from 
Theorem mH 

Theorem 1.3. Let J- = {(Q n , K n ,ir n )\n — 1,2,...} be a family of irreducible birth 
and death chains as above. For n > 1, let 0, A n> i, ..., \ n ,n be eigenvalues of I — K n 
and set 

n 

A„ = min \ n>i , s n = J^ K\- 

i=l 

Then, the following are equivalent. 

(1) T c has a total variation cutoff. 

(2) For S £ (0, 1), Fs has a total variation cutoff. 

(3) s n \ n -> oo. 
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The remaining of this article is organized as follows. In Section 2, the concepts 
of cutoffs and mixing times are introduced and fundamental results are reviewed. 
In Section 3, a detailed comparison of the cutoff time and window size is made 
between the continuous time and lazy discrete time cases, where the state space is 
allowed to be infinite. In Section 4, we focus on finite birth and death chains and 
provide a criterion on total variation cutoffs using the eigenvalues of the transition 
matrices. 

2. Cutoffs in total variation 

Throughout this paper, for any two sequences s n ,t n of positive numbers, we 
write s„ = 0(t n ) if there are C > 0, N > such that \s n \ < C\t n \ for n > N. If 
S n = 0(t n ) and t n — 0(s n ), we write s n x £„. If t n /s n — > 1 as n — > oo, we write 

Consider the following definitions. 

Definition 2.1. Referring to the notation in (jl.2p . a family T = {(f2 n , K n , 7r n )|n = 
1,2,...} is said to present a total variation 

(1) precutoff if there is a sequence t n and B > A > such that 

lim d n , TV (\Bt n ]) = 0, liminf d„, TV (|yl£ n J) > 0. 

n— >oo n— >oo 

(2) cutoff if there is a sequence t n such that, for all e > 0, 

lim d„, TV (r(l + e)t n ]) = 0, lim d n , TV ([(l - e)t n \) = 1. 

n— >oo n— 7-oo 

(3) {t n ,b n ) cutoff if b n = o(£„) and 

lim F(c) = 0, lim £(c) = 1, 

c— >oo c— > — oo 

where 
F(c) = limsupd n! Tv(^n + c6„]), F(c) = liminf d n ,Tv([t n + cb n \). 



In definition 12. 1[ t n is called a cutoff time and 6 n is called a window for t„ . The 
cutoffs for continuous semigroups is the same except the deletion of [•] and |_ - J • 

Remark 2.1. In Definition 12. 1[ if t n — > oo (or equivalently T niT v(e) -^ oo for some 
e £ (0, 1)), then the cutoff is consistent with ()1.4|) . This is also true for cutoffs in 
continuous semigroups without the assumption t n — > oo. 

The following lemma characterizes the total variation convergence using specific 
subsequences of indices and events, which is useful in proving and disproving cutoffs. 

Lemma 2.1. Consider a family of irreducible and positive recurrent Markov chains 
{(O n , K n ,TT n )\n = 1,2,...}. Lett n be a sequence of nonnegative integers. Then, the 
following are equivalent. 

(1) 4,Tv(tn) -> 0. 

(2) For any increasing sequence of positive integers n^, any A nk C fl nk and 
any x nk € f2 Kfc , there is a subsequence mk such that 



lim 

k— >-oo 



-ft-mk \Xm k ,A mk ) TTm k \A mk ) 



0. 
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Proof of Lemma \2.1[ (1)=>(2) is obvious. For (2)=>(1), choose A n c £l n and 
x n <G fl n such that d n ^ v (t n ) < 2\Kjf(x n ,A n ) — ir n (A n )\. Let rife be an increasing 
sequence of positive integers and choose a subsequence uik such that 



lim 



J^m k \Xm k , A mk ) 7T mfc (A„ lfc J 



= 0. 



This implies d mktT v(t mk ) — > 0, as desired. □ 

Remark 2.2. Lemma 12.11 also holds in continuous time under the release of t n to 
positive real numbers. See [H [5] for further discussions on cutoffs. 

3. Comparisons of cutoffs 

In this section, we establish the relation of cutoffs between lazy walks and con- 
tinuous semigroups. Let f2 be a countable set and K be a transition matrix indexed 
by ft. In the notation of (11.11) . the 5-lazy walk evolves in accordance with 

(Ksf = J2 (?) 5 l -\l - 8fK\ V6 € (0, 1), * > 0, 

whereas the continuous time chain follows 

Vl / +* " 

i=0 ^ 

Observe that I - K = (I - Kg) /{I - 5). This implies 

(3.1) 4 C v (*) < e- t/(1 - 5) £ m ~ S)]l + d ( T S l(m). 

Concerning the cutoff times and windows, we discuss each of them in detail. 

3.1. Cutoff times. 

Theorem 3.1. Let J- = {(£!„, K n ,ir n )\n = 1,2,...} be a family of irreducible 
Markov chains on countable state spaces with stationary distributions. For S £ 
(0,1), let T s = {(Cl n ,K n> s,TT n )\n = 1,2,...} and T c = {(Q„,if njt ,7r n )|n = 1,2,...}. 
Suppose there is eq > such that T„^, v (eo) —>■ oo or T^% v (eo) —> oo. Then, the 
following are equivalent. 

(1) !Fs has a cutoff (resp. precutoff) in total variation. 

(2) J- c has a cutoff (resp. precutoff) in total variation. 

Furthermore, if \F C has a cutoff, then 

71(e) , \ 



lim "■ vv ' =1-6, VeS(0,l). 

i n,TV\ t / 

The above theorem is in fact a simple corollary of the following proposition. 



Proposition 3.2. Let T^^Tc be families in Theorem \3. 1\ and t„ . r„ be sequences 
tending to infinity. Fix S £ (0, 1). 

(1) //4? TV (M)->0; then 



M 

l n,T 
for all c > and for any sequence b n satisfying y/t^ = o{b n ). 



lim d { n c, TV ((l - S)t n + cb n ) = 0, 

n— foo 
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(2) If&,(r n )-+0, then 



lim d^ v (\r n /(l-S) + cb n ])=0, 



(-5) 
n,T 

for all c > and for any sequence b n satisfying y/r^ = o(b r , 

(S) 
(4) If d&vilt*}) ~> 1, then 



(3) If&r(r n )->1, then 



lim d { : TV ([r n /(l-5)\) = l. 

n— >oo 



lim dl c) TV ((l ~ <J)*n) = 1. 

n— foc> 

Proof. We prove (1), while (2) goes in a similar way and is omitted. Suppose 
d-n Tv(\tn\ ) — >• 0. Since v^n = o(6 n ), it is clear that 

(3.2) lim d^ ] TV (\t n + cb n + c'VQ) = 0, Vc> 0, c' G K. 

n— foo ' 

Fix c > and let x n £ fl n , A n c fin. Given any increasing sequence n;, we may 
choose, according to Lemma [3~8l a subsequence to; such that 7r TO , (^4 m , ) — > a G [0,1] 
and, for all c' € K, 



, ™ A mi,S \Xjni,-f*mi) 



'->°° V27r5 

and 

lim H mi> (i_S)(t m . +cb m ) (%i ) An, ) 



'OO 

-oc 


e 


(x 


-c'f/(2S) f(x)dx 


/ 


■CX) 

-oo 


e~ 


x2/2 f(x)dx, 



where / is nonnegative and bounded by 1. By (|3.2[) and Lemma 1331 / equals to a 
almost everywhere and, by Lemma |2.1[ this implies d^ TV ((l — S)t n + cb n ) — > as 
n — > oo for all c > 0. 

The proofs for (3) and (4) are similar and we only give the details for (4). First, 
we choose sequences x n £ £l n and A n C £l n such that 

lim w n (A n ) = 1, lim K^y (x n , A n ) = 0. 



n— too n— ¥oo 



Let rii be a sequence tending to infinity. Applying Lemma 13.81 with c = and 
a n ,m = K™(x n , A n ), we may choose a subsequence, say m/, such that 

lim H mt a_s) t (x mn A mi ) = — = / e~ x2/2 g(x)dx 

and 

l™ <1 (*™. > ^ ) = ^f=j / e-'/^ff^dx, 

where g is nonnegative measurable function bounded by 1. This leads to g = 
almost everywhere and 



TVVV- 1 - u ) L miJ — x - 

D 



lim dm.TvC^-^m,) = 1 



The following is a simple corollary of Proposition I3T21 (l)-(2). 



Corollary 3.3. Lei Ts,T c be families in Theorem \3.1\ and t n ,r n be sequences 
tending to infinity. Fix 8 £ (0, 1). 



G.-Y. CHEN AND L. SALOFF-COSTE 
AS) 



(I) J/< J TV (M)->0, then 



lim d;r TV ((l + e)(l - S)t n ) = 0, Ve > 0. 

n—too ' 

(2) If d^ TY {r n ) ^ Q, then 

lim 4 5 tv([(1 + e)r„/(l - 5)1) = 0, Ve > 0. 

n— >oo 

Proof of Theorem \3.1\ Set r„ = T„tv( £ o) and s n — T„% v (eo). Suppose r n — > oo. 
By Corollary [33] (2), if 



liminf(4% v ((l-(5)r n /2) = 0, 
liminf d^ v (r(l + e)r n /2"|) = 0, Ve > 0. 



n— 5-oo 

then 

,/"■' 
n— »-oo 

But, taking e = 1/2 implies that, for n large enough, 

d ( n%(\(l + e)r„/2l) > <U(^« - 1) > e > 0. 
This makes a contradiction and, hence, if r n — > oo, then 

liminf di c) TV ((l ~ S)r n /2) > 0. 

In a similar way, if s n — > oo, then Corollary [231(1) implies 

liminfd^ ) TV ([s„]) > 0. 

This proves the following equivalence. 

T„ xv( £ o) —> °o for some eo > <$■ ^n tv( 6 o) ~^ °o for some eo > 0. 

For the equivalence of (1) and (2), the proof for prccutoffs is given by Corollary 
13.31 (l)-(2), while the proof for cutoffs also uses Proposition ^. 21 (3)-(4). D 

3.2. Cutoff windows. This section is devoted to the comparison of cutoff windows 
introduced in Definition 12.11 

Theorem 3.4. Let J 7 be a family of irreducible positive recurrent Markov chains 
and Tb-,T C be associated families of lazy walks and continuous semigroups. Let 
t n ,b n be sequences of positive reals and assume that t n — > oo. If J- 5 (resp. T c ) 
presents a (t n ,b n ) cutoff in total variation, then \ft^, = 0(b n ). 

Remark 3.1. There are examples with cutoffs but the order of any window size must 
be bigger than y/t^. Consider the Ehrenfest chain on {0, ...,n}, which is a birth 
and death chain with rates p n ^ — I — i/n, q n> i = i/n and r n ^ — 0. It is obvious 
that K n is irreducible and periodic with stationary distribution w n (i) — 2~"("). 
An application of the representation theory shows that, for < i < n, 2i/n is an 
eigenvalue of I— K n . Let A„ = 2/n and s n = J27=i n/{2i) = ^nlogn + 0{n). By 
Theorem 14.11 since A„s„ tends to infinity, both J- c and F5 have a total variation 
cutoff. For a detailed computation on the total variation and the L 2 -distance, see 
e.g. [7J. It is well-known that J- c has a (jnlogn, n) total variation cutoff. By 
Theorem 13.51 J-$ has a { jttz§) i n ) total variation cutoff for 5 € (0,1), which is 
nontrivial. For the continuous time Ehrenfest chains, Theorem 13.41 says that the 
window size is at least ^/nlogn, while n is the correct order. 
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Proof of Theorem \3.4\ We prove the continuous time case. The lazy discrete time 
case can be treated similarly. Assume the inverse that the sequence \/t^/b n is not 
bounded. By considering the subsequence of y/t^/b n which tends to infinity, it loses 
no generality to assume that b n = o(y/t^). According to the definition of cutoffs, 
we may choose C > 0, x n e Q n and A n C £l n such that 

\immi\H n j n+ cb n {xn,A n ) - Tr n (A n )\ > 0. 

By Lemma 13.81 one may choose a sequence ni tending to infinity such that n ni (A ni ) 
converges to a € [0,1] and 

lim H n tt +cb n .(iniAi) = -/== / e ~ x /2 f(x)dx ^ a, 

*->«> V27T J-oo 

where / is positive and bounded by 1. Let cGi For any e > 0, choose N > 
such that, for n > N, 



H nMcbn (x n ,A n )- V ( e ~(u +cbn) (tn + cb n ) 



i:\i—t n \<N-s/tn 



"■ n \Xni A n ) 



< e. 



Note that 

e -(t n+ c bn ) (tn + cb n y = e _ {tn+CK) (t n + cb n y + ag n ^ ^^ 

v. v. 

where o(l) is uniform for \i — t n \ < N^/t^. This implies 

1 f°° 
lim H nut „,+ c 6 „.(in,Ai) = "7^= / e_!C /2 f{x)dx, Vc € R. 

*->■<» ' V27T J_oo 

Since J- c presents a (£„, b n ) cutoff, the right-side integral is equal to a, a contradic- 
tion. □ 



Theorem 3.5. Let J-g,J- c be families in Theorem \3.4\ and t n — > oo. Then, the 
following are equivalent. 

(1) F s has a (t n ,b n ) cutoff. 

(2) T c has a ((1 - 8)t n ,b n ) cutoff. 

To prove this theorem, we need the following proposition. 



Proposition 3.6. Let J-g,J- c be as in Theorem \S.5\ and t„ . r„ be sequences tending 
to infinity. 

(1) If J- s has a (t n ,b n ) cutoff, then J- c has a ((1 — 5)t n ,d n ) cutoff for any 
sequence satisfying d n — o(t n ) and b„ = o(d n ). 

(2) If T c has a (r n ,b n ) cutoff, then Ts has a (r„/(l — 6),d n ) cutoff for any 
sequence satisfying d n — o(r n ) and b n = o(d n ). 



Proof. Immediately from Theorem 13.41 and Proposition 13.21 □ 

Proof of Theorem \3.5[ We prove (1)=>(2), while the reasoning for (2)=>(1) is sim- 
ilar. Suppose that J-$ has a (t n ,b n ) cutoff with t n — > oo. Fix e e (0,1) and 
set c n — |T„^ v (e) — (1 — 5)t n \. By 5, Proposition 2.3], it remains to show that 
c n = 0(b n ). Assume the inverse, that is, there is a subsequence £ = {ni\l — 1, 2, ...} 
such that c n Jb ni — > oo as I — > oo . Let J-5(£), J-" c (£) be families of J-^, .F c restricted to 
£. This implies J«(£) has a (t ni ,b ni ) cutoff, but J" c (£) has no ((1 - 6)t n ,, \Jb ni c ni ) 
cutoff, a contradiction with Proposition 13.61 □ 



8 G.-Y. CHEN AND L. SALOFF-COSTE 

3.3. Chains with specified initial states. For any probability \x on a countable 
set f2, we write (/x, O, K, n) as an irreducible Markov chain on Q with transition 
matrix K , stationary distribution n and initial distribution //. The total variation 
distances for the associated <5-lazy walk and continuous time chain are defined by 

(3.3) d$(A*,») = IIM? - 7t||tv, 4vGM) = WliHt - 7t|| T v. 

Denoted by T.p V (/i, e), T!f,y (/x, e) are the corresponding mixing times and the concept 



of cutoffs can be defined similarly as Definition 12.11 according to (J3T3J). It is an easy 
exercise to achieve a similar version of Lemma 12.11 for cutoffs with specified initial 



distributions. The proofs for Propositions 13 . 21 1 3 . 61 and Corollarv l3.3l can be adapted 
to the case when the initial distribution is prescribed. This gives the following 
theorems. 

Theorem 3.7. Let J- = {(/j, n ,n n ,K n ,ir n )\n — 1,2, ...} be a family of irreducible 
Markov chains and J-$,J- C be families of associated S-lazy walks and continuous 
time chains. 

(1) J- $ has a cutoff (resp. precutoff) iff J- c has a cutoff (resp. precutoff). 

(2) If J~s has a cutoff, then T„% v (/Xn., e) ~ (1 — S)T„ tvCMtu e ) as n tends to oo 
for all ee (0,1). 

Let t n — > oo and b n > 0. 

(3) J-s has a (t n ,b n ) cutoff iff T c has a ((1 — 5)t n ,b n ) cutoff. 

(4) If Ts has a (t n ,b n ) cutoff, then \ft^— 0(b n ). 

3.4. Proofs. This subsection collects required techniques for the proof of theorems 
in Sections I3"dll3~31 

Lemma 3.8. Let a n>m G [0, 1], t n > and c£l. Suppose that t n — > oo. Then, 
there is a subsequence rife of positive integers and a nonnegative measurable function 
f bounded by 1 such that 

r V^ I ~t nL ~c^/i~{tn k + Cy/t^) m \ 1 f°° -l x - c ) 2 /2r, \i 

hm > e k V k a nk ,m = —?== / e v ' ' f(x)dx, 



fc^oo ^— ' V ml J V2vr 

m— v / v 

for all c € M, and 

lim V f [(t " fc + c ^ )/(1 " 5) A (i - gyn S l(tn h +cy/i^)ni-S)]-m c 

m>0 v 7 

■ / e -(- C ) 2 /(25) /(a;)dx5 



V2tt(5 7- 
/or aZ/ c € K, 5 € (0, 1), where [z] is any of \z\ , \_z\ 

Proof. For n > 1 and any Borel set Act, set 



/Xra(^) — /j— / y a n,m- 



m:m~t n / y/tZ&A 

Let rife be a subsequence of N such that 

(3.4) lim |*n fc ((o,6]) = A*((o,6]), Va,6eQ, a < fe. 

fe— >oo 

Clearly, /x((a, &]) < 6 — a for a < b and a, 6 G Q. This implies the convergence in 
(I3.4|) holds for all a < 6 and //((a, 6]) < 6 — a. As a consequence of the Caratheodory 
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extension theorem, fi can be extended to a measure on R. It is obvious that /i is 
absolutely continuous with respect to the Lebesgue measure and we write / as the 
Radon-Nykodym derivative. 

Let e > and choose M > such that, for n > M, 

_ ' m! 

m:\m-t n \/i/t^£(-M,M] 

For any integer JV > 1, set or* = iM/N and A n ^ = {m > 0||m — t n \/-^t^ £ 
(xi,Xi+i]}. By Stirling's formula, it is easy to see that 

. t „- e ^(fn + cVQ m _ l + o(l)_ J 1 /m-^ >" 



e v ; = — , cxp < — — == c > as n — t oo , 

m! 72^ p | 2 V v^ / J 

where o(l) is uniformly for m £ ^4 n .i and — TV < i < N. This implies 



/ I C ,| &n,ru 



_ tri _ cv ^ (i„ + C\/Q m \ _ J < Mni n {A n ,i)/V2TT + o(l) 

m£A„ 



m! / ™' m 1 > m^ n (A n .i)/V2^ + o(l) 



where Ui — sup{e ^ c "> / 2 \x € {Xi,x%+\\ and Li = inf{e ( x c ' / 2 |a; G (2^,2^+1]}. 
Summing up i and replacing n with n^ yields 



N-\ 



and 

liminf > e fc V * . a „ fc . m > — = > miH{{Xi,x i+ i \) - e. 

fe^oo ^ V m! / V27T .^— '.. 

Letting TV — >• oo and then e — > gives the desired limit. The proof of the second 
limit is similar and omitted. □ 

Lemma 3.9. Let f be a bounded nonnegative measurable function and set F(t) — 
J_ e~( x ^ f(x)dx. If F is constant, then f is constant almost everywhere. 

Proof. Set A = F(t), 5" 1 = J^ e~ x '' '/ 2 f '{x)dx and write 

e-^-Wfix) = B- 1 v^e t2/4 (-^e-^ 2 /A [Be'^' 2 f{x)) . 

Note that AB/(^/2Tre t / 4 ) is the density of X + Y, where X has the standard 
normal distribution, Y is continuous with density function Be~ x ^ 2 f(x) and X, Y 
are independent. This implies AB = l/\/2 and 

e- u2 = ¥,{e lu{X+Y) ) = e-" 2/2 E(e mr ), Vw G R. 
Clearly, Y has the standard normal distribution and, thus, / is a constant a.e. □ 



10 
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3.5. A remark on the spectral gap and mixing time. In this subsection, we 
make a comparison of spectral gaps between continuous time chains and cj-lazy 
discrete time chains. Let (CI, K, tt) be an irreducible and reversible finite Markov 
chain with spectral gap A, the smallest non-zero eigenvalue of I — K . First, we 
consider the continuous time case. Since (K, ir) is reversible, there is a function / 
defined on {0, 1, ..., n} such that Kf = (1 — A)/. This implies 



M 



1 



d^(t) = -\\H t - * 



> 



\\(Ht-rr)f\\ 



-M 



where \\A\\ 



■- sup{\\Ag\\ 



-.(c) 



r$( c )> 



2||/|U 2 ' 

= 1}. Consequently, we obtain 
log(2e) 



A 



For the lazy discrete time case, a similar discussion yields 

log(2e) 



d$(t)>#/2, T^(e) 



> 



log ft 



where ft is the second largest absolute value of all nontrivial eigenvalue values of 
Kg. By setting So = ini{5 G (0, l)|ft = 1 — (1 — S)X}, it is easy to see that S < 1/2 
and, for 8 € [So, 1), ft = 1 — (1 — S)X. As a function of 5, ft is decreasing on (0, So) 
and increasing on (So, 1). Note that |1 — (1 — <5)A| < ft < max{l — 2S, 1 — (1 — S)X}. 
The first inequality implies 1 — ft < (1 — S)X. Using the second inequality, if 
ft > 1 - 25, then 1 - j3 s = (1 - <5)A. If ft < 1 - 2<5, then 1 - ft > 25 > SX, 
where the last inequality uses the fact A < 2. We summarize the discussion in the 
following lemma. 

Lemma 3.10. Let K be an irreducible transition matrix on a finite set CI with 
stationary distribution ~k . For 5 G (0, 1), let Ks be the 5-lazy walk given by (11.11) . 
Suppose (tt,K) is reversible, that is, n(x)K(x,y) — ir(y)K(y,x) for all x,y G CI 
and let X be the smallest non-zero eigenvalue of I — K and ft be the largest absolute 
value of all nontrivial eigenvalues of Ks ■ Then, it holds true that 

min{l- S,S}X < 1-ft < 1 

Furthermore, for e G (0, 1/2), 



T$( C )> 



log(2e) 
A 



4v } (e)> 



ll-(l- 


S)X 


<(1-S)X, V<5e(0,l). 


log(2e) 

_ log ft _ 


> 


- log(2e) 


_2max{l-(5,log(2/£)}A_ 



where the last inequality assumes \Cl\ > 2/5. 

Proof. It remains to prove the second inequality in the lower bound of the mixing 
time for the 5-lazy chain. Note that if A < 1/2, then 

- logft < - log(l - (1 - S)X) < 2(1 - 5)X, 

where the last inequality uses the fact log(l — x) > —2x for x G (0, 1/2). For 
A > 1/2, let 0i (S), ...,6\ n \{S) be eigenvalues of K s . Then, 6i(S) = S + (1 - S)6 t (0) 
and 53i=i ^i(0) ^ 0- See [12] for a reference on the second inequality. This implies 

|n| 
l + (\Cl\-l)l3s>J2^(S)>\Cl\S. 
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Assuming |fi| > 2/5, the above inequality yields 



ft>H^4 -.o gft <( 21 o g |], 



4. Finite birth and death chains 



□ 



In this section, we consider the total variation cutoff for birth and death chains. 
A birth and death chain on {0, 1, ..., n} with birth rate pi, death rate qi and holding 
rate ri is a Markov chain with transition matrix K given by 

(4.1) K(i,i + l)= Pi , K(i,i-l) = qi , K(i,i)=r h V0<i<n, 

where pi + qi + r^ = 1 and p n = qo = 0. It is obvious that K is irreducible if and 
only if Piqi+i > for < i < n. Under the assumption of irreducibility, the unique 
stationary distribution n of K is given by ir(i) — c(pq ■ ■ • pi-i)/(qi ■ ■ ■ qi), where c 
is a positive constant such that X}j=o ""(*) = •"■• 

In the next two subsections, we recall some results developed in OHO] and make 
an improvement on them using the result in Section 3. In the third subsection, 
we go back to the issue of cutoffs and make a comparison of total variation and 
separation cutoffs. 

4.1. The total variation cutoff. Throughout this subsection, we let 

(4.2) F={(Cl n ,K n ,n n )\n=l,2,...} 

denote a family of irreducible birth and death chains with Q n — {0, 1, ...,n} and 
transition matrix 

(4.3) K n (i,i + 1) =p n ,i, K n (i,i - 1) = q n>i , K n (i,i) = r n ,i, V0 < i < n, 

where p n ,i + q n ,i + Tn,i = 1 and p n , n = q n .o = 0. Write A„ = \{K n ) as the spectral 
gap of K n . As before, T c denotes the family of associated continuous semigroups 
and, for S £ (0, 1), J-s denotes the family of <5-lazy chains. Recall one of the main 
results in [10] as follows. 

Theorem 4.1 (Theorems 3-3.1 in [10]). Consider the family in (|4.2I) . For n > 1, 
let X n be the smallest nonzero eigenvalue of I — K n and let j3 rit s be the second 
largest absolute value of all nontrivial eigenvalues of K n g. Then, T c (resp. Ts 
with 8 £ (0, 1)) has a total variation cutoff if and only i/T„% v (l/4)A n — > oo (resp. 
T„^ v (l/4)(1 — f3 n ,s) — > oo). Moreover, if T c (resp. F$) has a cutoff, then the 
window has size at most \iT^ v (l/4)/X n (resp. ^^^(1/4)1(1 — /3 ni g)). 

Remark 4.1. By Lemma 13.101 the total variation cutoff in discrete time case is 
equivalent to T„ ^ v (l/4)A n — > oo. By Theorems 13 . 1H4 . ll and Lemma l3.10[ if T c or J 7 ^ 

has a cutoff, then the window size is at most a/T„% v (1/4)/A„ or d T n ^ v (l/4)/A„. 

Remark 4.2. There are examples with cutoffs, but the order of the optimal window 



size is less than i/T„^ v (l/4)A„. See Remark I3TT1 

The combination of the above theorem and Theorem 13.11 yields 
Theorem 4.2. Referring to Theorem\4-l\ the following are equivalent. 
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(1) J- c has a total variation cutoff. 

(2) Ts has a total variation cutoff. 

(3) T c has a total variation precutoff. 

(4) T$ has a total variation precutoff. 

(5) T„^ v (e)A n — »• oo for some e e (0, 1). 

(6) T„ j v (e)A n — > oo for some e 6 (0, 1). 

Proof of Theorem \4-.2\ It remains to show (3)=>(5) and this is given by the inequal- 
ity d { n %(t) > e- A "72. □ 

Theorem 4.3. Consider the family in (|4.2[) . It holds true that T„| v (e/2) x 
T„^ v (ry/2) for all e,n,8 € (0,1). Furthermore, if there is eo € (0,1) such that 
T n.Tv( e o/2)X n orT„| v (eo/2)A„ is bounded, thenT r [ c ' TV (e/2) X 1/A„ aKrfT„| v (e/2) : 
1/A„ /or alle,6€ (0,1). 

Proof of Theorem \4-3\ Assume that there is a subsequence n^ and e,?) € (0,1/2) 
such that cither 7^ TV (eJ/T^ftj) -> oo or T^M/T^^e) -> oo. By 
Lemma [3.101 we have T^' TV (e)X nk — > oo or T^ ' TV (r])X nk — > oo. In either case, 
Theorems I3.HI47TI imply that the subfamily indexed by (nk)\^ 1 has a cutoff in both 
continuous time and <5-lazy discrete time cases. As a consequence of Theorem 13. 1[ 
we obtain T„ TV ( e ) ~ (l~^)'^n fc .Tv( ? 7): which contradicts with the assumption. D 

Concerning the window size, a combination of Theorem 13.41 and Theorem 14.11 
yields 

Theorem 4.4. Let J 7 , X n be as in Theorem \4-. 1\ Suppose that J- c or Ts has a total 
variation cutoff and \ n x 1. Then, for any e, r\ £ (0, 1) with e ^ n, 



rgv(e) - r$ v(r?) | x ^/r^ v (c) x |T r ( ^ v ( e ) - ^(„) 

4.2. The separation cutoff. In this subsection, we apply the results obtained in 

the previous subsection to the separation cutoff. First, we give a definition of the 

separation in the following. Given an irreducible finite Markov chain K on Q, with 

initial distribution \x and stationary distribution n, the separation distance at time 

m is defined by 

, / f uK m (x)' 

d scp (^,, m) := max < 1 rT - 

p xen [ 7r(x) 

Aldous and Diaconis [5] introduce the concept of the strong stationary time to 

identify the separation distance. Set d sep (m) = max^ d scp (i, m). A well-known 

bound on the separation is achieved by Aldous and Fill in Lemma 7 of [TJ Chapter 

4], which says 

(4.4) d{m) < 4cp(m), 4c P (2m) < 1 - (1 - d{m)f , 

where d(m) :— max.;,., ||lif m (i, •) — K m (j, -)I!tv- It is clear from the definitions that 
d T v(m) < d{m) < 2d TV (m). Let T sep (e) be the separation mixing time. The above 
inequalities imply 

(4.5) T TV (e) < T scp (e) < 2T TV (e/4), Vee(0,l). 

Note that the above discussions are also valid for the continuous time case. As the 
separation distance is between (0, 1), the separation cutoff is similar to the total 
variation cutoff as in Definition 12.11 By (|4.5p . we obtain the following lemma. 
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Lemma 4.5. Let T be a family of finite Markov chains in either discrete or con- 
tinuous time case. Assume that T„ iT v(e) ->oo or T n , scp (e) — > oo for some e £ (0, 1) 
in discrete time case. Then, T has a total variation precutoff if and only if T has 
a separation precutoff. 



For birth and death chains, the application of (|4.5j) to Theorem 14.31 leads to the 
following theorem. 

Theorem 4.6. Theorem \4-3\ also holds in separation. Furthermore, for e, r\ £ 
(0,1/2), Ti% v (e)~T$ ep (r,). 

Let K be an irreducible birth and death chain on {0,1, ...,n} with stationary 
distribution n. The authors in [TU] obtain the following fact 

(4-6) 4%(t) = l-^^-, 4£(m) = l-^fe) V*€ [1/2,1). 

v ir{n) * ir(n) 

The authors in [5] provide a criterion on the separation cutoff for continuous time 
chains and monotone discrete time chains. The result says that a separation cutoff 
exists if and only if the product of the spectral gap and the separation mixing time 
tends to infinity. The next theorem is a consequence of this fact and Theorems [ 
and 14.61 which is also obtained in 1101. 



Theorem 4.7. Let J 7 be a family of birth and death chains given by (|4.2p . The 
following are equivalent. 

(1) T c has a cutoff in total variation. 

(2) For S £ (0, 1), J~s has a cutoff in total variation. 

(3) T c has a cutoff in separation. 

(4) For 5 £ [1/2, 1), J-$ has a cutoff in separation. 

The next theorem is a simple corollary of Theorems 14.2114.71 and Lemma 14.51 
Theorem 4.8. Theorem\4-3\ also holds in separation distance with 6 £ [1/2, 1). 



4.3. The cutoff time in total variation and separation. In this subsection, we 
introduce a spectral representation of the total variation mixing time. Let K be the 
transition kernel of an irreducible birth and death chain on {0, 1, ...,n}. Suppose 
that K is irreducible with stationary distribution 7r and let < Ai < • • • < A n be 
the eigenvalues of I — K. Consider the continuous time case. Using [5} Theorem 
4.1] and [TD1 Corollary 4.5], we have 

,M/s H t (0,n) H t (n,0) m ,„ 

4%(t) = 1 - ^Yy 1 = 1 - ^t-- 1 = PS > t), 
v n(n) 7r(0) 

where S is a sum of n independent exponential random variables with parameters 
Ai, ..., A n . By the one-sided Chebyshev inequality, one has 



ES - v/Var(5)/(l/e - 1) < Tjg(e) < ES + y/(l/e - l)Var(S), Ve £ (0, 1). 

n 1 n 1 

E5 = Er' Var(5) = ^-2 <(E5) 2 . 



ep 

Note that 



2—1 2—1 ' 



Clearly, this implies 



VI -e Ve 
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The above equation says that, given e G (0,1/2), the separation mixing time is 
bounded by X)"=i ^\ U P ^° universal constants. The above discussion is also valid 
for discrete time case with the assumption that K(i,i + 1) + K(i + l,i) < 1 for 
< i < n. See [5] for the details. The next proposition is an application of (|4.5[) 
and IjiTT]) . 

Proposition 4.9. Lef iiT 6e an irreducible birth and death chain on {0,1, ...,n}. 
Let 0, Ai, ..., A„ be eigenvalues of K and set s — X)"=i ^i~ ■ Then, 



and 



( VT ^ / ' e )^ T ^^( V ' + £~ £ )°> Vee (0,1/2), 

1 ( y/T=te-y/te \ (C ) ^+yT~7 ^ W w n1/8 N 

2^ VI -46 J^^v(e)<(^ ^ J., Vee (0,1/8). 

T/ie above also holds in discrete time case with the assumption that K(i,i + 1) + 
K(i + l,i) < 1 for < i < n. 

Applying Proposition l4.9l to Theorems l4.2jl4.7l vields the following theorem, where 
the result in separation is included in [9] and the result in total variation is implicitly 
obtained in |10) . 

Theorem 4.10 (Cutoffs from the spectrum). Let T be the family in (|4.2[) . For 

n > 1, let X n ,i, ■•■, Xn,n be non-zero eigenvalues of I — K n and set 

A„ = min A„,j, s n = 1 h . 

Then, the following are equivalent. 

(1) T c has a total variation cutoff. 

(2) For 5 G (0, 1), J-'s has a total variation cutoff. 

(3) J- c has a total variation precutoff. 

(4) For S G (0, 1), Fs has a total variation precutoff. 

(5) s n X n -» oo. 

The above also holds in separation with 5 £ [1/2, 1). Ln particular, if (5) holds, 
then, for e € (0, 1), 

1 - ,. . r -'n,Tv( e J , ,. ^n,TVV e J ^ , 

— < nmmi < limsup < 1. 

Z n— >oo Sn ?i— foo S n 

The last result establishes a relation between the mixing time and birth and death 
rates. Consider an irreducible birth and death chain (X m ) n ^ =0 on {0, 1, ...,n} with 
transition matrix K and stationary distribution n. Let N t be a Poisson process of 
parameter 1 that is independent of X m and set, for < i < n, 

n := inf{£ > 0\X Nt = *}. 

Brown and Shao discuss the distribution of n in [3] and obtain the following result. 



v.>t)=E n Ah" 



^ 



... . Jk — Oj 
3=1 \k^j 



where Pi is the conditional probability given Xq = i and 9\, ...,#n are eigenvalues 
of the submatrix of I — K indexed by {0, 1, ..., n — 1}. Let Ej be the conditional 



ir(k)pk ' 
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expectation given Xq — i. Clearly, this implies Eot„ = X)?=i V^r Note that Eor n 
can be formulated by the birth and death rates using the strong Markov property. 
This leads to 

(4-8) fcr.-t'.g^*]) 

3=1 i fc=0 

where w(A) :=£ ieA 7r(i). 

Fix < i < n. By (|4~gl) . we have 

i=l A * fel A * 

where A' 1: ..., A£ o and A", ..., A"_ io are eigenvalues of the submatrices of I — K in- 
dexed respectively by {0, ...,io — 1} and {io + 1, ...,n}. Let Ai < ••• < A n be a 
rearrangement of A' 1; ..., A^ o , A", ..., A"_ i() . Clearly, Ai, ..., A„ are eigenvalues of the 
submatrix obtained by removing the io-th row and the io-th column oi I — K. Let 
Ai < ■ • • < A n be nonzero eigenvalues of I — K . By Theorem 4.3.8 in [11], we have 
Aj < Ai < Aj+i and this leads to 



n 1 



v^ 1 r 1 nt* 1 *-([o, *]) A tt([M) 



< > — < 



E 



4=2 A » h x * H A * t» n ^ pk fc ir + i n{k)qk ' 

where the first equality uses (|4.8|l . By Proposition 14.91 we obtain, for e € (0, 1), 



' i-l 



4%) < rW(e) < ( ^ + f^ ) mm £ %M2 + £ « . 
V Ve / °^ n [t^o K{k)p k fe f-^ ir(k)q k J 

The above discussion also holds in discrete time case with the assumption that 
Pi + qi+i < 1 for all < i < n. This includes the <5Tazy chain for 5 e [1/2, 1) and 
we apply it to get the following corollary. 

Corollary 4.11. Let T = {(il n ,K n , 7r„)|n = 1,2,...} be a family of irreducible 
birth and death chain in (14. 2[) with birth, death and holding rates p n ^,q n .i,r n .i. For 
n > 1, sei 



t„ = min 



A 7r„([0,fc]) -A 7r„([fc,n] 



0<i<n 1 j^ 7T n (fc)Pn,fc fc^li 7r "( fc )'?n,fe j 

// J-" c or J^ /ias a total variation cutoff, then, for e G (0, 1) and <5 g [1/2, 1), 
Jn,sop(e) ,. -7n iSep (eJ ^ 1 



hm sup - — < 1 , hm sup - — < 

and, /or e € (0, 1), 

T (c) (el T (<5) (e) 1 



hm sup < 1 lim sup — ■ < 



Remark 4.3. In [B], the constant £„ in Corollary 14.111 is proved to be of the same 
order as the constant s n in Theorem 14.101 and the following term 

„ „(|U. A ) 



v^ ^([0,^]) v^ 7r »([ fc ' n ]) 



f^ Q TT n {k)PnM k= ~ +1 ^n{k)q n ,, 



where i„ satisfies n n ([0, i n \) > 1/2 and n n ([i n ,n]) > 1/2. 
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Remark 4.4. The bound in Corollary HUT] is also be obtained implicitly in [10] using 
a coupling argument. 
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